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We present a measurement of the ti cross section using high-multiplicity jet events produced in pp 
collisions at ^J~s~ — 1.96 TeV. These data were recorded at the Fermilab Tevatron collider with the DO 
detector. Events with at least six jets, two of them identified as b jets, were selected from a 1 fb _ 
data set. The measured cross section, assuming a top quark mass of 175 GeV/c 2 , is 6.9 ± 2.0 pb, in 
agreement with theoretical expectations. 

PACS numbers: 14.65.Ha 



I. INTRODUCTION 



The top quark is the most massive fundamental parti- 
cle ever observed. Its mass, m t = 173.1 ±1.3 GeV/c 2 Q, 
is approximately twice that of the next heaviest elemen- 
tary particle, the Z boson, and is approximately 35 times 
that of its weak-isospin partner, the bottom quark. Top 
quarks are primarily produced in pairs at the Fermi- 
lab Tevatron pp collider via the qq ~ * ti (w 85%) and 
gg — > ti (ss 15%) quantum chromodynamic (QCD) pro- 



cesses. They decay to a W boson and a b quark with 
a branching fraction near one according to the standard 
model (SM). The W boson subsequently decays into a 
lepton and a neutrino or into a quark- antiquark pair. 
The decay products of the W bosons are used to clas- 
sify the top quark decay channel. The all-hadronic decay 
channel, with a branching fraction of 46% 0], has a fi- 
nal state containing two b quarks and four lighter quarks 
and is shown schematically in Fig. [T] The top quark 
might also decay into non-SM particles (e.g., a charged 
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Higgs boson) and the decay products of these new par- 
ticles can change the branching fractions of the leptonic 
and all-hadronic it decay channels 0] ■ Comparing the it 
production cross section between different decay channels 
directly constrains the existence of beyond the standard 
model particles lighter than the top quark. 

In this paper, we present a new measurement of the 
pp — > ti+X cross section using events containing at least 
six jets, two of them identified as b jets. The data sample 
corresponds to w 1 fb _1 acquired by the DO experiment 
at a center-of-mass energy y/s — 1.96 TeV. DO previously 
published a measurement of the it cross section in multi- 
jet events with 0.4 fb _1 of integrated luminosity and ob- 
tained 4.5±?;g (stat.)ti.t (sys.) ±0.3 (lum.) pb [4]. CDF 
published a similar measurement with 1 fb _1 and ob- 
tained 8.3±1.0 (stat.)lls (sys.)±0.5 (lum.) pb % Both 
measurements assumed m t — 175 GeV/c 2 and agree with 
the cross section measurement presented in this paper 
and with the SM expectation of 6.901°;^ pb 0,0. 

The dominant source of background in the all-hadronic 
channel is QCD multijet production. Rather than rely- 
ing on event generators such as pythia HERWIG @, 
or ALPGEN [1(| to reproduce all characteristics of events 
with six or more jets, we instead derived a background 
sample from the triggered data (Sec. IIII B[) . The back- 
ground was suppressed compared to signal by requiring 
at least two of the jets be identified as b jets (Sec. HIE]) . 
The it signal was simulated by the ALPGEN event gener- 
ator that used pythia with the tune A [llj parameter 
settings for the parton shower, hadronization, and un- 
derlying event aspects. Kinematic selection criteria were 
applied to further improve the signal-to-background ra- 
tio to approximately 1 : 7 ( Sec. IIII Dj) . The it production 
cross section was extracted using signal and background 
templates for a likelihood discriminant constructed from 
topological and kinematic observables. (Sec. IIV[) . 



proton 



antiproton 




FIG. 1: Dominant Feynman diagram for it production in the 
all-hadronic decay channel. The t decays into a W + b and the 
W + decays into either ud or cs (represented by the q and q' in 
the figure); the i and W~ decay into the charge conjugates. 
The event signature consists of two b jets and at least four 
other jets. 



II. DETECTOR AND RECONSTRUCTION 

A. Detector 

The DO detector [l2| has a central-tracking system con- 
sisting of a silicon microstrip tracker (SMT) and a cen- 
tral fiber tracker (CFT), both located within a 2 T su- 
perconducting solenoidal magnet, with designs optimized 
for tracking and vertexing at pseudorapidities \rj\ < 3 
and 1 17 1 < 2.5, respectively |T3[. Central and forward 
preshower detectors are positioned just outside of the 
superconducting coil. The liquid-argon and uranium 
calorimeter has a central section (CC) covering pseudo- 
rapidities \rj\ < 1.1 and two end calorimeters (EC) that 
extend coverage to |t?| w 4.2, with all three housed in 
separate cryostats [14| . Each calorimeter contains a four- 
layer electromagnetic (EM) section closest to the inter- 
action region, followed by finely- and coarsely-segmented 
hadronic sections. Scintillators between the CC and 
EC cryostats provide sampling of developing showers at 
1-1 \v\ ~ 1-4- The luminosity is measured u sing scin- 
tillators placed in front of the EC cryostats [lq . An 
outer muon system, covering \rj\ < 2, consists of a layer 
of tracking detectors and scintillation trigger counters in 
front of 1.8 T iron toroids, followed by two similar lay- 
ers beyond the toroids. The trigger and data acquisition 
systems were designed to accommodate the high lumi- 
nosities of Tevatron Run II. 



B. Trigger 

The events used in this analysis were collected using a 
multijet trigger. The first level of the trigger used ded- 
icated hardware and preliminary information from the 
calorimeter to identify multijet events. This selection was 
refined in a second level with more complex algorithms. 
The third trigger level employed a fast reconstruction of 
the event with a simple cone jet algorithm [1 61 ] - This se- 
lection was further refined using the final reconstruction 
algorithms which included the midpoint cone jet algo- 
rithm [16| | . Kinematic and jet multiplicity requirements 
were applied at each stage to reduce the overall data rate. 

The trigger required at least four reconstructed jets. 
The specific requirements on the jets, particularly the en- 
ergy thresholds, were changed several times during data 
collection to cope with the increasing instantaneous lumi- 
nosity delivered by the Tevatron. Efficiencies were mea- 
sured independently for each trigger epoch and combined 
together weighted by the integrated luminosity of each 
epoch. Rather than correcting the data for inefficiencies 
in the trigger, the simulated it signal was weighted by 
the trigger efficiency. The average trigger efficiency for 
it signal events that passed all selection criteria used in 
this analysis is shown in Fig. [2] as a function of Ht where 
Ht = ~^2pt over all jets and px is the transverse momen- 
tum of a jet. The background sample was created from 
the triggered data (see Sec. IIII Bp and therefore need no 
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FIG. 2: Average trigger efficiency for simulated all-hadronic 
ti events as a function of Ht- The untriggered tt Ht distri- 
bution, normalized to unit area, is also shown (scale shown 
on the right.) Displayed error bars represent statistical un- 
certainties only. 
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FIG. 3: The distribution of the primary vertex z position with 
respect to the center of the detector in the triggered data. The 
solid line is a fit to the region with \zpv\ < 35 cm, while the 
dotted line is an extrapolation of the fit outside that region. 
Displayed error bars represent statistical uncertainties only. 
The distribution is normalized to unit area. 



additional corrections. 



C. Tracks and Vertices 

Tracks were reconstructed from hit information in the 
SMT and CFT. The location of the hard-scatter interac- 
tion point was reconstructed by means of an adaptive pri- 
mary vertex algorithm [UHII]. Only vertices constructed 
from at least three tracks were considered in this analy- 
sis; O(40) tracks are associated, on average, with primary 
vertices in simulated all-hadronic tt events. A distribu- 
tion of the location of primary vertices along the z axis 
in triggered events is displayed in Fig. [3) The primary 
interaction vertex was required to be within 35 cm of the 
center of the detector along the z axis to keep it within 
the fiducial volume of the SMT [l8[ . The distribution in 
Fig. [3] was fitted within the |zpy| < 35 cm range with the 
sum of two Gaussians. The fit extrapolation outside this 
range is also shown. The total primary vertex acceptance 
was 79.5 ± 2.0%. 



D. Jets 

Jets were reconstructed from energy deposits in 
calorimeter cells using the Run II midpoint cone algo- 
rithm (l6j with a cone radius K = ^(A<J)) 2 + (Ay) 2 = 
0.5 [lH. Noisy calorimeter cells were suppressed by only 
including cells that had energies > 4cr above the aver- 
age electronic noise and that also had adjacent cells with 



energies > 2a above noise. Jets were required to have 
< 40% of their energy in the coarse hadronic calorime- 
ter, have at least half the remaining transverse energy 
matched to energy depositions identified by the hardware 
trigger, and have between 5% and 95% of their energy in 
the EM calorimeter. These requirements were for jets 
reconstructed in the CC; they were looser at forward ra- 
pidities. 

Jet energies were corrected for the energy response of 
the calorimeter, for the effect of particles showering out- 
side the jet cone, for overlaps due to multiple interac- 
tions and event pile-up, and for calorimeter noise [20| . 
The calorimeter response was measured using the px im- 
balance in 7+jet and dijet events; the response of the 
calorimeter to electromagnetic showers was calibrated us- 
ing the Z — > e + e~ mass peak and a detailed accounting 
of the material between the calorimeter and the interac- 
tion point. The jet energy calibration also used Z+jet 
events and events acquired using low bias triggers. Jets 
that contained muons, assumed to originate from c- or b- 
hadron decays, were corrected to account for the energy 
of the muon and the accompanying neutrino. Muons with 
Pt > 60 GeV/c were treated as having pt = 60 GeV/c 
to avoid the impact from poorly reconstructed muon mo- 
menta. Jet energies were calibrated independently in the 
data and in the simulation using the same methodology. 
Jets in the simulation required additional corrections to 
reproduce the reconstruction efficiency and energy res- 
olution in the data. The uncertainty on the jet energy 
calibration is w 1.5%. 

Jets were further required to be matched with at least 
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FIG. 4: Comparison of taggability fraction in selected mul- 
tijet data after selection with that in the ti simulation as a 
function of jet pr- Displayed error bars represent statistical 
uncertainties only. 



two good quality tracks having px > 1 GeV/c and 
Pt > 0.5 GeV/c, respectively, that included SMT hits 
and pointed to the primary vertex. These requirements 
are termed "taggability" and are important for identi- 
fying heavy-flavor jets (Sec. Ill E|) and to reject jets pro- 
duced by overlapping pp collisions. The taggability frac- 
tion depends nominally on the jet pr, jet rapidity, zpy, 
sign(zpy x ?7jet) x |zpv|, and the flavor of the jet [18| . 
The fraction of jets that were taggable was measured us- 
ing the selected sample of multijet events fSec. lIII Dl) and 
is shown in Fig. 3] binned in jet px- Differences between 
the taggability determined with multijet data and with 
the ti signal simulation could bias the cross section mea- 
surement. The ti simulation yielded the same taggability 
fraction as a function of jet px and r] as the multijet data 
within the statistical uncertainties (Fig. The uncer- 
tainty on the relative difference between data and simu- 
lation is 2% and is dominated by the limited statistics in 
the comparison. 



E. b Jets 

Jets that contain a b hadron are called "6 jets" as they 
typically originate from a b quark, b hadrons have rel- 
atively long lifetimes and so usually travel several mil- 
limeters before they decay. Secondary vertices, displaced 
from the primary vertex, are usually formed by the tracks 
associated with the decay products of the b hadron. 

An artificial neural network (NN) was used to iden- 
tify b jets [2l[. Selected characteristics of secondary ver- 



tices and tracks associated with b hadron decays were 
used as inputs to the NN. These included aspects of the 
secondary vertex such as its decay length significance, 
goodness of fit, number of tracks, mass of the system 
of particles associated with the vertex, and the number 
of secondary vertices found in the jet. Additionally, the 
weighted combination of track impact parameter signif- 
icances and the probability that the jet originated from 
the primary vertex were also input into the NN. 

The probability to identify a b jet, the tag rate func- 
tion, was measured in data and parametrized as a func- 
tion of the jet pt and r\. Similar functions were de- 
termined for charm jets. The fake rate, the probabil- 
ity to assign a b tag to a non-6 jet, was dominated by 
light jets and long-lived particles (e.g., K®, A ). The b- 
tagging efficiency is (57 ± 2)%, the tagging efficiency for 
charm is (15 ± 1)%, and the fake rate is (0.57 ± 0.07)% 
for the NN output threshold used in this analysis at 
p T = 40 GeV/c (U. 



III. ANALYSIS TECHNIQUES 

A. Data Sample 

The data used for this analysis were collected between 
August 2002 and February 2006 with the four-jet trig- 
ger described in Sec. Iff Bl Quality requirements were im- 
posed on the selected data; runs or parts of runs in which 
detector systems essential to this analysis had problems 
or significant noise were discarded. The integrated lu- 
minosity of the data sample, including these trigger and 
quality requirements, is 0.97 ± 0.06 fb -1 . 



B. Background Model 

QCD multijet events that have at least two heavy- 
flavor jets are the dominant source of background to ti 
production in the all-hadronic decay channel. This large 
background is distinguished from the ti signal by exploit- 
ing differences between the kinematic and topological dis- 
tributions of jets in ti and multijet events. Correlations 
between jets, particularly for b jets, must be reproduced 
for the observables used in this analysis. 

The background sample was created using triggered 
data events. Signal contamination in the background 
sample was minimized by selecting events with two b- 
tagged jets and low jet multiplicities. Samples of events 
with at least four taggable jets having pt > 15 GeV/c 
were selected from the triggered data. The 6-jet iden- 
tification criteria described in Sec. Ill El were applied to 
these samples; events were kept if there were at least two 
tagged jets. The background sample was then created 
by attaching \ow~pt jets selected from events with six or 
more jets to events with four or five jets. A reasonable 
distribution of the jets in the available phase space was 
ensured using a set of matching criteria. 
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FIG. 5: AR between the two leading 6-tagged jets in 4-jet and 5-jet events with (a) pr > 15 GeV/c; (b) pr > 40 GeV/c. The 
peak near AR ~ n is dominated by direct bb production while the peak near A_R w 1 (twice the jet radius) is mainly g — > bb. 
Displayed error bars represent statistical uncertainties only. Distributions are normalized to unit area. 



One concern with basing the background distributions 
on a lower jet-multiplicity sample was that the relative 
contributions of different production diagrams might de- 
pend strongly on jet multiplicity. This was tested by 
examining distributions of the AR between the b jets. 
We expect a peak near tt for bb produced in 2 — * 2 hard 
scatters, whereas we expect a peak near one (twice the 
jet radius) for bb produced via gluon splitting, g — > bb. 
This is illustrated for four and five jet events in Fig. 
Figure 02a) shows Ai?^ for b jets with pr > 15 GeV/c 
while Fig.[!Jb) is the AR bb for b jets withp T > 40 GeV/c. 
The relative height of the two peaks depends strongly on 
the pr requirement, but there is little difference between 
four- and five-jet events. The gluon-splitting contribu- 
tion is significantly suppressed by increasing the 6-jet pt 
requirement from 15 to 40 GeV/c. 

The scheme for creating a background sample was de- 
veloped in a relatively pure QCD multijet context. A 
"background" sample was constructed by adding the low- 
est pt jet from five-jet events to four-jet events. The two 
sources of jets were matched together to ensure compat- 
ible phase-space configurations. The leading jets in each 
sample were required to have a difference in px {Apt) 
within 1 GeV/c. Matches resulting in unphysical config- 
urations (e.g., spatially overlapping jets) were rejected. 
The background event statistics were enhanced by run- 
ning twenty times over the four- and five-jet samples. In 
each step the Apr requirement was relaxed by 1 GeV/c. 

One issue with this matching scheme is that an ini- 
tial four-jet event might not have sufficient phase space 



for an additional jet. Since QCD multijet events are not 
expected to contain significant missing transverse energy 
QSt), the presence of $t implies the presence of unrecon- 
structed or mismeasured jets which makes these events 
more suitable for use in the background sample. How- 
ever, badly mis-reconstructed events or events containing 
hard neutrinos can skew the phase space. Requiring the 
ratio of Ifir to Ht4 = Ylt=i VTi to be small reduced these 
contributions. Agreement between the "signal" (unadul- 
terated) and "background" five-jet samples was best with 
I$t > 5 GeV/c &ndI^T / Hta < 0.1. Variations in this ad- 
ditional phase space selection were included in the sys- 
tematic uncertainty evaluation 22 1. 

The resulting events were compared with the five-jet 
sample as illustrated in Fig. [HI Reasonable agreement 
was achieved with the individual jet pt distributions and 
with their sum. These manufactured background events 
are also compared against the five-jet events for several 
topological variables (defined in Sec. IIII E[) in Fig. [7] 

Both the original four-jet sample used to create these 
five-jet "background" events and the "signal" five-jet 
sample to which it was compared had little contamination 
from ti (0.2% and 0.7%, respectively), so this tests our 
ability to use one multijet sample to create a represen- 
tation of a higher-multiplicity sample. This scheme was 
extended to produce the background sample for events 
with six or more jets. In this case, the lowest pr jets 
were added to either four-jet (fifth and lower px jets) 
or five-jet (sixth and lower pr jets) samples. There was 
no reason to prefer the four-jet-initiated background over 
the one built from a five-jet sample. Instead, an equal 
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FIG. 6: Comparisons between the five-jet data and the background created from four-jet data for the pr distributions of each 
jet (pti through prs) and for Ht- The leading four jets were required to have pr > 40 GeV/c. Displayed error bars represent 
statistical uncertainties only. Distributions are normalized to unit area. 



mix of the two was used for the final background sample 
and the difference between the two separate background 
samples and the mixed sample was used when evaluat- 
ing systematic uncertainties. Variations between the two 
samples as a function of Ht are shown in Fig. [Sj Also 
shown is the change in the background due to systematic 
variations in the phase-space matching criteria described 
above. 



C. Signal Model 

The tt signal was simulated with the ALPGEN event 
generator. Two inclusive tt samples were used in this 
analysis: one with mt = 170 GeV/c 2 and one with 
TTii — 175 GeV/c 2 [HI]. PYTHIA, with the tune A param- 
eter settings, was used for the parton shower, hadroniza- 
tion, and underlying event aspects. The resulting events 
were processed through a GEANT [24| simulation of the 
DO detector and underwent the full reconstruction and 
analysis procedure. Information from data events se- 
lected by a random beam crossing trigger were overlayed 
on the simulated events to reproduce experimental con- 
ditions including detector noise and overlapping pp in- 



teractions. The instantaneous luminosity distribution of 
the simulated events was weighted to match that of the 
triggered data. 

Several additional corrections were applied to the sim- 
ulated events. First, the event generator used the lead- 
ing order (LO) parton distribution functions (PDF) from 
CTEQ6L1 (2f| [2^|. Events were reweighted to corre- 
spond to the CTEQ6.5M [27[ PDF. Second, the de- 
fault heavy-flavor fragmentation function in pythia was 
reweighted to one that described the LEP e + e~ data [28| . 
Additionally, the resolutions of reconstructed objects in 
the simulation were slightly better than those in the data, 
so the energies of jets, muons, and electrons were smeared 
to reproduce the resolutions observed in data [29]. The 
jet identification efficiency is slightly higher in the sim- 
ulation than in data. Therefore, jets in the simulation 
were randomly removed to make the efficiencies agree. 



D. Event Selection 

Selection criteria were applied to triggered events to 
minimize background while retaining a relatively high 
signal efficiency. The selection criteria, together with 
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FIG. 7: Comparisons between the five-jet data and the background created from four-jet data for variables used in the likelihood 
discriminant. The leading four jets were required to have pt > 40 GeV/c. Displayed error bars represent statistical uncertainties 
only. Distributions are normalized to unit area. 



the number of events after each cut, the cut efficiency 
e, and the cumulative selection efficiency e cum , are pre- 
sented in Table |TJ Values are given for the all-hadronic 
ti signal, for signal in all other tt decay channels, and for 
the data-based background. The signal fraction in the 
final selected sample corresponded to a purity of 12.5% 
(as found in Sec. IIV A[) . As the background was derived 
from triggered data, the minimum set of requirements 
on that sample, which also included a reconstructed pri- 
mary vertex with |zpv| < 35 cm and > 4 jets having 
Pt > 15 GeV/c, are listed as the second line in Table [J 



This corresponded to a starting signal-to-background ra- 
tio of approximately 1 : 7700. 

Events with isolated high-px electrons and muons were 
removed to avoid overlap with other DO tt cross section 
measurements [10, [HI ■ This requirement had little effect 
on the all-hadronic tt signal, but did remove a consider- 
able number of events from the background. 

Events considered in this analysis were required to 
have at least six jets. Each jet was required to be tag- 
gable, have px > 15 GeV/c, and \r]\ < 2.5. Further- 
more, at least four of the jets were required to have 
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FIG. 8: Systematic variations in the background sample with six or more jets as a function of Ht- (a) Comparisons with the 
background samples created using only four-jet (4+2) or five-jet (5+1) events, (b) Comparisons including one-sigma systematic 
variations in the phase-space matching criteria. The leading four jets were required to have pr > 40 GeV/c. Distributions are 
normalized to unit area. 



TABLE I: The number of events after each selection requirement. Each selection is inclusive of the ones above it. Shown are the 
criteria, the number of events that pass the selection, the efficiency of the selection (e), and the cumulative selection efficiency 
(£cum) for all-hadronic ti, all other ti decay channels, and the data-based background. The m t = 175 GeV/c 2 sample was 
used for the signal expectation. Signal and background numbers have been adjusted, using the 12.5% signal fraction measured 
in this analysis, to sum to the number of candidate events selected in the data. Statistical uncertainties are included for the 
overall signal efficiency. 





All Hadronic tt 


Other tt 




Background 


Approx. 
S:B 


Selection 


Num. e 


£cum 


Num. 




fcum 


Num. 




Scum 




(%) 


(%) 




(%o) 


(%) 




(%) 


(%) 


Total 


3024 100.0 


100.0 


3712 


100.0 


100.0 










Trigger, vertex, > 4 jets with pr > 15 GeV/c 


1663 55.0 


55.0 


773 


20.8 


20.8 


18856263 


100.0 


100.0 


1:7700 


Lepton veto 


1662 100.0 


55.0 


558 


72.2 


15.0 


12679185 


67.2 


67.3 


1:5700 


> 6 jets with p T > 15 GeV/c 


913 55.0 


30.2 


165 


29.6 


4.5 


1734595 


13.7 


9.2 


1:1600 


> 6 taggable jets with pr > 15 GeV/c 


628 68.8 


20.8 


60 


36.3 


1.6 


506277 


29.2 


2.7 


1:740 


> 2 b-tagged jets with pr > 40 GeV/c 


150 23.8 


4.9 


13 


21.8 


0.4 


2562 


0.5 


0.014 


1:16 


> 3 jets with p T > 40 GeV/c 


147 98.1 


4.9 


12 


95.2 


0.3 


2059 


80.4 


0.011 


1:13 


> 4 jets with pr > 40 GeV/c 


122 83.2 


4.0 


9 


70.3 


0.2 


920 


44.7 


0.0049 


1:7 


Efficiency 

Inclusive ti Efficiency 


(4.04 + 0.02)% 

(1.94 + 


(0.24 + 0.01)% 
0.01)% 





Pt > 40 GcV/c. At least two of these high-p^ jets were 
required to be b tagged. These additional jet require- 
ments improve the signal-to-background ratio by a factor 
of 100. 

In total, 1051 data events satisfy the selection cri- 
teria. The efficiency for all-hadronic ti events with 
m t = 175 GeV/c 2 is (_4.04 ± 0.02)% while the overall 
efficiency for inclusive ti events is (1.94 ± 0.01)% (statis- 
tical uncertainties only) . The equivalent efficiencies with 
m t = 170 GeV/c 2 are (3.65 ±0.04)% and (1.76 ± 0.02)%, 



respectively. Given these efficiencies and the standard 
model branching fractions, w 93% of the selected ti 
events are from the all-hadronic decay channel. The sur- 
viving leptonic ti events were primarily from the £+jets 
(w 60%) and r+jets (« 40%) decay channels. Few dilep- 
tonic events survived the full selection criteria (w 0.05% 
of ti). 

The expected signal-to-background ratio, given the 
12.5% signal purity extracted during the cross section 
measurement, is 1 : 7. 
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FIG. 9: Probability distributions for the variables input into the likelihood ratio. The signal distributions were extracted from 
the sample with mt = 175 GeV/c 2 . Displayed error bars represent statistical uncertainties only. 



E. Maximum Likelihood 

A likelihood discriminant based on topological observ- 
ables was constructed to separate the all-hadronic tt sig- 
nal from the multijet background. The likelihood ratio, 
L, for an event i is defined as 

L _ L s (i) 

L s {i) + L B {iy 

where 

n var 
k=l 



for signal and similarly for background. Here, Vs,k is the 
signal probability density function, normalized to unit 
area, for the fcth input variable Xk, and n va r is the number 
of variables. The TMVA [32[ package was used to build 
the probability distributions and the resulting likelihood 
ratio. 

The criteria for selection of observables to be input into 
the likelihood were: separation between signal and back- 
ground, reasonable agreement in the five-jet background 
validation, little correlation with other chosen variables, 
and little dependence on jet energies (to minimize sys- 
tematic uncertainty due to jet energy calibration). The 
following nine variables were used in the likelihood deter- 
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mination and are shown for simulated signal and data- 
based background events in Fig. [51 

C is the centrality defined as the scalar sum of jet 
Pt divided by the sum of jet energies; 

H' T is the scalar sum of jet px excluding the two high- 
est pt jets; 

B is the ratio of the dijet mass of the two leading 
6-tagged jets to the total mass of all the jets; 

Aa, A3 are the smallest two eigenvalues of the momen- 
tum tensor = J2iP?Pi Wi\ 2 where i 
runs over the number of jets and a, (3 = 1,2,3 
denote the three spatial components of the jet mo- 
menta [HI]; 

j/34 is the rapidity difference between the third and 
fourth leading jets; 

A234 is the pt asymmetry between the second and third 
jet and the fourth jet defined as {pt2 + PT3 — 
PT4,)/(PT2+PT3 +PT4); 

(yb) is the pT-weighted average of the rapidities of the 
leading two 6-tagged jets; 

(yi) is the pr-weighted average of the rapidities of the 
leading two light (not b tagged) jets. 

Comparisons are shown in Fig. [7] for these variables in 
the five-jet background validation sample. The com- 
bined probability distributions for signal and background 
are shown in Fig. 1101 The probability distributions and 
likelihoods were extracted independently for the m t = 
170 GeV/c 2 and 175 GeV/c 2 samples. 



IV. RESULTS 

A. Signal Fraction 

The signal and background likelihood templates were 
fit to the likelihood output, shown in Fig. [TU1 for the 
selected data events using tminuit (34| from ROOT [35| . 
Results from the fit are shown in Fig. [11] and are in agree- 
ment with the data. The measured signal fractions are 
(12.9 ±2.4)% for m t = 170 GeV/c 2 and (12.5 ±2.3)% for 
m t = 175 GeV/c 2 . Given 1051 data candidate events, 
this results in 136 and 131 tt events, respectively. Dis- 
tributions for the observables included in the likelihood, 
using the signal and background fractions from the fit, 
are shown in Fig. [T2] for m t — 175 GeV/c 2 . There is 
reasonable agreement between the data candidates and 
the sum of signal and background, normalized to the fit 
results. 

Jets in an event can be associated with the decays of 
individual top quarks. A \ 2 was constructed comparing 
the dijet masses with the W boson mass and the two bjj 
masses with each other. The combination with the lowest 
X 2 value was chosen. The results for the dijet mass and 
the bjj mass are shown in Figs. I13f c) and (d). There 
is good agreement between data and the sum of signal 
and background. The comparison is also made in a re- 
gion of phase space dominated by background (L < 0.2) 
and one which has a significantly larger signal fraction 
(L > 0.8), also shown in Fig.[l3l The distributions were 
not renormalized. Both the background-dominated and 
signal-enhanced distributions show reasonable agreement 
between data and the sum of signal and background. 
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FIG. 11: Comparison of the distributions of likelihood output values, L, for the selected data candidates (points) with those 
from the tt signal and data-based background samples. Signal and background were fit to the data candidates and are presented 
with a normalization equal to the fit purity times the number of candidate events for the signal. Displayed error bars represent 
statistical uncertainties only, (a) Signal sample with vat = 170 GeV/c 2 ; (b) mt = 175 GeV/c 2 . 



B. Systematic Uncertainties 

The effects of systematic uncertainties and variations 
in input variables were studied using ensemble tests. Ten 
thousand pseudo-experiments were run for each source of 
uncertainty. Each pseudo-experiment drew events from 
the systematically-shifted signal and background distri- 
butions and was fit using the standard signal and back- 
ground likelihood templates. With the exception of the 
two background-related systematics, all of the systematic 
uncertainties are associated with the signal simulation 
only. All systematic uncertainties on the tt production 
cross section measured with m t — 175 GeV/c 2 are sum- 
marised in Table HU Many of these are described in more 
detail in earlier sections of this paper. 

This analysis relies on ALPGEN+pythia for the tt sig- 
nal model used to determine the selection efficiency (Ta- 
ble U) and the kinematic shapes included in the likelihood 
determination (Fig. [9]). It is possible that the tt simula- 
tion does not properly reproduce the properties of the 
tt system. Other analyses in the lepton+jets and dilep- 
ton decay channels published by the DO collaboration 
have found good agreement between the simulation and 
the reconstructed data [5^, [3(| [U, Nevertheless, 
the simulation might mis-estimate the jet multiplicity 
through differences in the QCD radiation or the under- 
lying event. The measured fraction of reconstructed tt 
events (using the measured signal purities) with seven or 
more jets is 0.29 ± 0.04. The signal events were weighted 
up and down by one standard deviation in the statistical 



uncertainty on this ratio (15%). The entire analysis was 
repeated and the resulting difference in the mean cross 
section applied as a systematic uncertainty. The PDF 
in the simulation were also reweighted to corres pon d to 
CTEQ6.5M. The modified tolerance approach [27|, HI 
was used to estimate the effects of the PDF uncertainties 
on the measured cross section. Both of these uncertain- 
ties, along with those related to the reweighting of the 
heavy-flavor fragmentation function, luminosity profile, 
and vertex distribution; are listed as the signal model 
uncertainty in Table [TTJ 

The dominant sources of systematic uncertainty in the 
tt cross section measurement are the jet energy calibra- 
tion (10.8%), construction of the data-based background 
(10.7%), b tagging (9.2%), and jet taggability (8.8%). 
The total systematic uncertainty is 20.5%. 



C. Cross Section Measurement 

The cross section is defined as 

fN 
Le 

where / is the measured fraction of tt signal, N is the 
number of selected data events, C is the integrated lumi- 
nosity, and e is the inclusive tt efficiency given in Table HI 
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FIG. 12: Comparison between the data candidates and the sum of tt signal with m t = 175 GeV/c 2 and the data-based 
background for the variables used in the likelihood discriminant. Displayed error bars represent statistical uncertainties only. 



This results in the following cross sections: 
^itoGcV/c 2 = ? g ± 15 ^ gtat ^ ± 16 ^ ± Q 5 ^ lum ^ pb 

^lTSGcV/c 2 = g g ± 13 ^ gtat j ± L4 ^ ± q 4 ^ um ^ pb 

The statistical uncertainty includes the statistical uncer- 
tainties associated with the signal and background tem- 
plates. The latter was determined by re-fitting the data 
100,000 times while allowing the signal and background 
templates to vary according to their bin-to-bin statisti- 
cal uncertainties. The resulting uncertainties are sum- 
marized in Table [TTJ Figure [TJ] shows the SM prediction 
together with the measured cross section from this anal- 



ysis. The SM expectation [6( is in agreement with the 
measured cross sections. 



V. CONCLUSIONS 

We presented the inclusive ti cross section measured 
in 1 fb -1 of pp interactions at y/s = 1.96 TeV. The 
cross section was extracted using high-multiplicity jet 
events, specifically events with at least six jets, two of 
them b tagged. A model of the multijet background 
was created from lower jet-multiplicity data. A likeli- 
hood discriminant was used to separate signal from back- 
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FIG. 13: Distributions for the reconstructed W boson (top row) and top quark (bottom row) masses using the nit = 175 GeV/c 2 
signal sample. There are two entries per event. Displayed error bars represent statistical uncertainties only, (c, d) Result from 
the purity fit. (a, b) additionally requiring L < 0.2 to enhance background; (e, f) additionally requiring L > 0.8 to enhance 
signal. 



TABLE II: Uncertainties on the tt cross section categorized 
by source for the result corresponding to mt = 175 GeV/c 2 . 
The uncertainties with mt = 170 GeV/c 2 are similar. 



Source 


Uncertainty (%) 


Candidate statistics 


±18.5 


Background model 


±10.7 


Background model statistics 


±3.8 


Signal model 


±3.2 


Signal model statistics 


±0.5 


Trigger 


-2.0 ±3.9 


Jet identification efficiency 


-2.5 ±3.0 


Jet taggability 


±8.8 


Jet energy calibration 


±10.8 


Jet energy resolution 


-3.1 ±2.2 


b tagging 


-8.6 ±9.2 


Total statistical uncertainty 


±18.9 


Total systematic uncertainty 


±20.5 


Luminosity uncertainty 


±6.1 



ground. The cross section was obtained from a likelihood 
fit to the discrimant distribution and was measured to be 
7.9 ±2.2 pb assuming m t = 170 GeV/c 2 , and 6.9 ±2.0 pb 
assuming m t = 175 GeV/c 2 . Both results agree with the- 
oretical expectations. 
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